194
Bibliography
[207] Shibani Santurkar, Dimitris Tsipras, Andrew Ilyas, and Aleksander Madry.
How
does batch normalization help optimization? In Proceedings of Advances in neural
information processing systems, pages 1–11, 2018.
[208] S. Shen, Z. Dong, J. Ye, L. Ma, Z. Yao, A. Gholami, M. W. Mahoney, and K. Keutzer.
Q-bert: Hessian based ultra low precision quantization of bert. In Proceedings of the
AAAI Conference on Artificial Intelligence, 2020.
[209] Sheng Shen, Zhen Dong, Jiayu Ye, Linjian Ma, Zhewei Yao, Amir Gholami, Michael W
Mahoney, and Kurt Keutzer. Q-bert: Hessian based ultra low precision quantization
of bert. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34,
pages 8815–8821, 2020.
[210] Ravid Shwartz-Ziv and Naftali Tishby. Opening the black box of deep neural networks
via information. arXiv:1703.00810, 2017.
[211] Karen Simonyan and Andrew Zisserman. Very deep convolutional networks for large-
scale image recognition. In Proceedings of the International Conference on Learning
Representations, pages 1–15, 2015.
[212] Chi Su, Jianing Li, Shiliang Zhang, Junliang Xing, Wen Gao, and Qi Tian. Pose-
driven deep convolutional model for person re-identification. In Proceedings of the
IEEE International Conference on Computer Vision, pages 3960–3969, 2017.
[213] Chi Su, Shiliang Zhang, Junliang Xing, Wen Gao, and Qi Tian. Deep attributes driven
multi-camera person re-identification. In Proceedings of the European Conference on
Computer Vision, pages 475–491, 2016.
[214] Yumin Suh, Jingdong Wang, Siyu Tang, Tao Mei, and Kyoung Mu Lee. Part-aligned
bilinear representations for person re-identification. In Proceedings of the European
Conference on Computer Vision, pages 402–419, 2018.
[215] Shengyang Sun, Changyou Chen, and Lawrence Carin. Learning structured weight
uncertainty in bayesian neural networks. In Proceedings of the Artificial Intelligence
and Statistics, pages 1283–1292, 2017.
[216] Shengyang Sun, Guodong Zhang, Jiaxin Shi, and Roger Grosse. Functional variational
bayesian neural networks. In Proceedings of the International Conference on Learning
Representations, pages 1–22, 2019.
[217] Siqi Sun, Yu Cheng, Zhe Gan, and Jingjing Liu. Patient knowledge distillation for
bert model compression. arXiv preprint arXiv:1908.09355, 2019.
[218] Siyang Sun, Yingjie Yin, Xingang Wang, De Xu, Wenqi Wu, and Qingyi Gu. Fast ob-
ject detection based on binary deep convolution neural networks. CAAI transactions
on intelligence technology, 3(4):191–197, 2018.
[219] Yifan Sun, Liang Zheng, Yi Yang, Qi Tian, and Shengjin Wang. Beyond part models:
Person retrieval with refined part pooling (and a strong convolutional baseline). In
Proceedings of the European Conference on Computer Vision, pages 480–496, 2018.
[220] Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir
Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich.
Going
deeper with convolutions. In Proceedings of the IEEE conference on computer vi-
sion and pattern recognition, pages 1–9, 2015.